FLOVA: A Four-issue Media Processor with 3D Graphics Acceleration Units
نویسندگان
چکیده
To process enormous multimedia data, we have designed a VLIW (Very Long Instruction Word) processor called FLOVA (FLOating-Point VLIW Architecture) exploiting the ILP (Instruction-Level Parallelism) in multimedia programs. FLOVA executes four instructions simultaneously in one cycle and supports 136 instructions including 35 media instructions to accelerate multimedia programs. This paper presents FLOVA’s special features that can accelerate transformation and lighting operations in 3D graphics geometry stage. FLOVA can calculate the power value of two floating-point numbers in 4 cycles with small loss of accuracy, while it takes over 150 cycles or requires a large ROM table in other processors.
منابع مشابه
Architectural Support for 3D Graphics in the Complex Streamed Instruction Set
In this paper we extend the previously proposed Complex Streamed Instruction Set (CSI) architecture to provide for floating-point computations and conditional execution in order to efficiently support 3D graphics applications. The CSI extension is evaluated using an industry standard 3D benchmark, and compared to the Intel’s Streaming SIMD Extension (SSE). Compared to a 4-way issue superscalar ...
متن کاملSubword parallelism with MAX-2
pro~rpms. T he genera-purpose computing workload is changing to include more processing of multimedia information. We define media processing as the processing of digital multimedia information, such as images. video, audio. 2D and 3D graphics, animation, and text. This multimedia data, at the lowest component level, tend to be 16 bits or less. However, general-purpose microprocessors are gener...
متن کاملHardware Acceleration for CGP: Graphics Processing Units
Graphic Processing Units (GPUs) are fast, highly parallel units. In addition to processing 3D graphics, modern GPUs can be programmed for more general-purpose computation. A GPU consists of a large number of ‘shader processors’, and conceptually operates as a single instruction multiple data (SIMD) or multiple instruction multiple data (MIMD) stream processor. A modern GPU can have several hund...
متن کاملExploiting Parallelism in Geometry Processing with General Purpose Processors and Floating-Point SIMD Instructions
Three dimensional (3D) graphics applications have become very important workloads running on today’s computer systems. A cost-effective graphics solution is to perform geometry processing of 3D graphics on the host CPU and have specialized hardware handle the rendering task. In this paper, we analyze microarchitecture and SIMD instruction set enhancements to a RISC superscalar processor for exp...
متن کاملgEMfitter: A Highly Parallel FFT-Based 3D Density Fitting Tool With GPU Texture Memory Acceleration gEMfitter: A GPU-Accelerated 3D Density Fitting Tool
Fitting high resolution protein structures into low resolution cryo-electron microscopy (cryo-EM) density maps is an important technique for modeling the atomic structures of very large macromolecular assemblies. This article presents “gEMfitter”, a highly parallel fast Fourier transform (FFT) EM density fitting program which can exploit the special hardware properties of modern graphics proces...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999